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Abstract 

We consider the problem of minimizing the size of a family of sets Q 
such that every subset of {1, . . . , n} can be written as a disjoint union 
of at most k members of where k and n are given numbers. This 
problem originates in a real-world application aiming at the diversity 
of industrial production. At the same time, the minimum oi\Q\ so that 
every subset of {1, . . . , n} is the union of two sets in Q has been asked 
by Erdos and studied recently by Fiiredi and Katona without requiring 
the disjointness of the sets. A simple construction providing a feasible 
solution is conjectured to be optimal for this problem for all values of 
n and k and regardless of the disjointness requirement; we prove this 
conjecture in special cases including all (n, k) for which n < 3fc holds, 
and some individual values of n and k. 

Keywords: Turan type problems, extremal problems in graphs and 
hypergraphs, diversity, semi-finished products. 



1 Introduction 



The n-element set {1, . . . , n} is denoted by [n]. For two positive integers n. A;, 
a family Q of subsets of [n] is said to k-generate X C [n] if X is the disjoint 
union of at most k members of ^. It /c- generates the family TC C Vdn]) if 
it /c-generates every X £ TC. It is called an (n, A:)-generator if it generates 
the entire powerset ^^([^t.]), that is, if every non-empty subset of [n] can be 
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obtained as a disjoint union of at most k members of Q. This work aims at 
determining the (n, A;)-generators of minimum size. The size of a set is the 
number of its elements (synonyme of cardinality). 

Sets of size 1 are called singletons. All the singletons {i} (i = 1, . . . ,n) 
must be contained in any (n, A;) -generator. We call an (n, A;)-generator Q 
of minimum size optimal, and introduce the notation opt(n, A;) := \Q\. A 
generator can be represented by a hypergraph (family of sets) where the 
vertices are the elements of [n] and the hyperedges are the members of Q. 

As Zoltan Fiiredi reports, Paul Erdos [2] asked about the case k = 2 
allowing the target-sets to be not necessarily disjoint unions of two members 
of Q. He conjectured that optimal generators consist of all the non-empty 
subsets of Vi and V2 , where Vi , V2 is a partition of [n] into two almost equal 
parts. Since every subset of [n] is the disjoint union of two sets in this 
generator, it is implicit in this conjecture that the optimum value does not 
depend on whether the two sets in the definition are required to be disjoint 
or not. 

Erdos also considered the problem of generating only sets of size at most 
s, where s is a positive integer. Fiiredi and Katona investigated this latter 
problem in [3j. For s < 2 the problem is void, and for s = 3 the problem is 
equivalent to Turan's theorem [6]. For s < 4, n > 8 they establish that the 
cardinality of an optimal generator is n + (2 ) — [|rej . When s < 4 it does 
clearly not matter whether the two sets are required to be disjoint or not. 
(The same may be true for s > 4 see Section [2l but we cannot prove this.) 
For all s > 4 the problem is apparently open. 

The same questions have been asked independently for optimizing the 
diversity of production in the motorcar industry. To answer market require- 
ments, many companies want to reduce the delay between the command and 
the delivery of a finished product, in the context of offering a large choice 
for the possible options of these products. The industrial problem that has 
to be faced is the following: determine the semi-finished products - each of 
which corresponds to a set of options - that must be stocked in order to 
be able to assemble any possible finished product in at most a given num- 
ber of operations pLj. This latter constraint guarantees an assembly time 
that does not exceed a desired time of delivery. The aim is to minimize 
the size of the stock under this constraint. This is equivalent to finding 
an optimal (n, fc)-generator, where n is the number of options, and k the 
maximum number of semi-finished products that can be assembled. From 
the viewpoint of industrial technology the disjointness constraint cannot be 
relaxed, and it is better to be able to generate all subsets. Refining these 
constraints, the optimization problems that can be stated occur to be too 
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difficult (NP-hard, see Section [5|); on the other hand, these rigid require- 
ments bring us to the prefixed constraints of extremal combinatorics versus 
the flexible inputs of algorithmic problems. These questions lead directly to 
beautiful and seemingly difficult mathematical problems. 

The basic problem studied in this article has been mentioned by the 
first author in the activity report of the project "decision making under 
uncertainty" at the Centre for Advanced Study of Oslo, in 2000-2001. Con- 
jecture [1] below is explicitly mentioned in [Ij independently of Erdos [2]. 
However, the only result about this problem so far seems to be [3]. 

In Section [2] we introduce the main construction and provide the related 
conjectures, remarks and some other preliminaries, including the relation of 
the problem to the Turan number. In Section [3] and Section [J] the main 
results of the paper and their proofs are presented, where Section [3] is an 
auxiliary section collecting general facts about the critical situation when 
for some n, k, v, Q is not an optimal (n, k) generator, but ^ — u is an optimal 
(n — 1, A; ) -generator. Finally, in Section [5] we show that natural refinements 
of the problem in the spirit of combinatorial optimization are NP-hard, and 
prove on the other hand that the construction provides a generator that 
does never exceeds a small constant times the optimum. In the Appendix 
we show some more results concerning the case k = 2, which enabled us to 
finish some more concrete particular cases of the conjecture. 

2 Construction 

A natural way of constructing a generator is to partition the set [n] into k 
parts and to include all the non-empty subsets of each part in the generator. 
The cardinality of such a generator is minimum when the sizes of the parts 
differ by at most one. 

More formally, let p := p{n,k) := [^] and r := r(n,k) such that n = 
pk — r with < r < A;. Let Vi, . . . ,Vk be a partition of [n] into r sets of 
size p — 1 and k — r sets of size p. The generator we are constructing for all 
n,k gTN is: 

CONSTR(n, k) := {V{Vi) U • • • U 7^(Vfc))\{0}, 

where V is an arbitrary set. The cardinality of such a generator is 
constr(n, k) := r x (2^"^ - 1) + {k - r) x (2^ - 1). Note that 

constr(n, k) = constr(n — l,k) + 2^~^, 

and this simple recursive formula seems to be useful to keep in mind. It is 
sufficient to prove the same recursive formula for opt(n, k). 
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For instance we have constr(13, 5) = 27 for n = 13 and k = 5. 
Clearly, opt(n, A;) < constr(n, A;), and in fact the equality seems to hold 
always: 

Conjecture 1 For all n,k £ ]N the generator CONSTR(n, k) is optimal. 

Quite surprisingly this conjecture arose in production management, and 
for A; = 2 it is a posthumus conjecture of Erdos: 

Indeed, as Zoltan Fiiredi reports, Erdos |2], [3] asked the same ques- 
tion for k = 2 without requiring the disjointness of the sets. Could the 
same assertion be true for arbitrary k ? Let op(n, A;) denote the opti- 
mum for this problem. Clearly, op(n. A;) < opt(n, A;) < constr(n, /c), so if 
op(n. A;) = constr(n. A;) is true for some {n,k), there is equality throughout 
for this {n,k). These equalities would mean that disjointness is an irrele- 
vant requirement (in the sense that it does not change the optimum value). 
Could this be proved by some simple argument without necessarily settling 
the conjectures (see Conjecture [7|) ? In many results of the paper opt(n, A;) 
can be replaced by op(n. A;), see some remarks at the end of Section [3l 

Moreover, we also conjecture the unicity of the construction: 

Conjecture 2 For all n,k e IN such that p{n, k) / 2, CONSTR(n, A;) is 
the unique optimal (n, k) -generator. 

Trying to prove the preceding two conjectures inductively leads to the 
following conjecture that would imply both (see the next section): 

For a hypergraph Q C Vdn]) and z G [n] let G{z) := {g £ Q : z Q}. 

Conjecture 3 For all n,k £ W, for every (n,k)- generator Q, there exists 
z G [n] such that 

\g{z)\ > 2P("''=)-i. 

We prove that Conjecture [3] is true for p = 1,2,3 and {n,k) G 
{(7, 2), (8, 2)} for which p = 4. 

Notice that the partition underlying the construction is the same as that 
in Turan's theorem [6]. The two are actually related. The Turan number 
T{n,s,l), where n,s,l are three positive integers with / < s < n, is the 
minimum number of subsets of size I of a set of size n, such that each subset 
of size s contains at least one of them. In a generator, since every subset of 
size {I — l)k + 1 must contain a member of size at least I, there are at least 
T(n, {I — l)k + 1, /) members of size at least I. 
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Turan solved this problem for 1 = 2. If Z = 2, that is s = k + 1, his 
problem can be stated as follows: minimize the number of edges of a graph 
on n vertices so that the maximum number of pairwise non-adjacent vertices 
does not exceed k. Replacing every member g of a generator by a pair which 
is a subset of g, we always have this property. Turan proved that the unique 
minimum for this number is given by k cliques of almost equal size that 
partition the vertex-set. This partition coincides with the defining partition 
of the construction, showing that, the number of members of size at least 
two in a generator is at least the number of sets of size exactly two in Turan's 
construction. 

For I > 3, Turan conjectured that the partition into blocks still gives 
the solution to its problem, but this appears to be false. According to 
Sidorenko 0, for n = 9, s = 5, / = 3 with k = 2 and s = {I — l)k + 1 
Turan's construction provides (3) -|- (3) = 14 subsets of size 3 so that every 
5-tuple contains at least one of them, whereas the affine plane of order 3 
gives a solution with only 12 subsets with the same property. This example 
has been adopted by Fiiredi and Katona to find the minimum number of 
sets that 2- generate all 4-tuples of a set. 

Indeed, for n = 9, the set of minimum size that 2-generates all 4-tuples 
can be defined with the help of the affine plane with q = 3: take the lines of 
two parallel classes (6 triplets) and the 2-element subsets of the lines for the 
two remaining parallel classes (9 pairs for each, in total 18). The generator 
Q consisting of these 24 sets and the singletons 2-generate all the sets of size 
at most 4. Generalizing this construction Fiiredi and Katona [3j prove that 
it provides the best estimate for 2-generating all 4-tuples for all n. Compare 
24 with the size of the subset of C0NSTR(9, 2) capable to achieve the same 
task, the 2- and 3-tuples of C0NSTR(9, 2), Q + (^) + Q + (^) = 30. With 
30 sets - add to Q the 6 lines of the affine space that are not yet included 
in it - actually the set of 5-tuples can also be generated. 

We cannot continue in this direction, since finding the Turan number 
when Z > 3 is known as a difficult open problem, moreover a closer direct 
look using more than just the containments provides better lower bounds 
for the diversity problem in general (Section [5]) . 

3 Induction 

In this section we show some general facts that may help in inductive proofs 
provided we still have an optimal generator after the deletion of one or two 
elements. In order to analyse how opt(n,A;) changes as a function of n we 
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need tight lower and upper estimates. The only upper estimate we have 
is constr(n, k) and we will use it all the time; in the lower estimates two 
parameters of a hypergraph will play a role, the degree and the minimum 
transversal and the like: 

For a hypergraph Q C ^([n]) and a subset Z C [n] we define: 



g-z 

gnz 
guz 
g/z 



{g&g 
{g&g 

{gnz 

{gUZ 
{9\Z 



gnz = 
ZQg} 

g&g} 
g&g} 
g&g} 



0} 



One element sets Z = {z} are often replaced by z, when the usage is 
evident. Let us see some examples of occurrences of z G [n] and U Q [n]: 

g-z = {geg : z^g} = g\g{z) 

g/z = {g\{z} : geg} 

giz)/z = {g\{z} : g G g{z)} 

g{z)-U = {gi^g : zeg,gnU = %] 

g{z)lAU = {g\JU -.geg , z^g] 

The quantity 1^(^)1 is usually called the degree of z in the hypergraph 
g. Note that g{z)/z = H if and only if g{z) = {z] U H. 

We will actually need to refine our sets and our quantities. For a hyper- 
graph g C ^([Ti]) and p G IN, i = 1, . . . p, we denote ^* := {5 G ^ : l^l > 
i}; constr*(n,A:) := | CONSTR*(n, A;)|. 

In C0NSTR(13, 5) there are 13 hyperedges of size 1, 11 of size 2 and 
3 of size 3, so constri(13,5) = 27, constr2(13, 5) = 14, constr3(13, 5) = 3; 
constr*(n. A;) — constr*"'"^(n, /c) {i = 1, . . . ,p) is the number of members of 
size exactly i. 

We should not dream for anything stronger than Conjecture [3l which 
implies already all the other conjectures. However, we may need more details 
for a proof (as it will be the case for some of our results): 

Conjecture 4 For all n,k £ IN, for every (n,k)- generator g we have: 

(1) \g^\ > constr*(n, k) for all i = 1, . . . ,p. 

Since constr^(n, k) = constr(n, k) this conjecture contains Conjectured! 
When the average degree is not far from the maximum (if n = pk oi more 
generally, when r is small comparing to k) it also implies Conjecture El 
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Proposition 1 Ifn = pk and (1) holds for a hypergraph Q, then the average 
degree in Q is at least 2^^^, and every degree is equal to this number if and 
only if there is equality everywhere in (1). 

Proof. The average degree of Q is equal to the sum of the sizes in Q divided 
by n, which in turn is equal to 

n 
i=l 

since a set of size s is encountered here for the values i = 1, . . . , s, that is, 
exactly s times. 

If (1) holds, then this number is greater than or equal to the average 
degree of the hypergraph CONSTR(n, A;), which is equal to 2^"^, since all 
degrees are equal to this number. Therefore all degrees are equal to 2^-^ if 
and only if there is equality everywhere in (1), as claimed. □ 



Proposition 2 For all i = I, . . . ,p : \{g & CONSTR(n, k) : \g\ = i}\ 
constr*(n, k) — constr*+-^ (n, k) = r^ , ^ + (A; — r) . 



□ 



If H C V{[n]) is a hypergraph, a transversal is a set that meets all 
members of 7i, and T{7i) denotes the minimum size of a transversal. If 7i 
has m disjoint members, then clearly t{TC) > m. liTi contains the empty 
set, it has no transversal, we define then T(7i) = oo. 

Generators can be characterized in term of transversals, by the following 
easy but useful proposition: 

Proposition 3 Let Q C V{[n]) he an {n,k)- generator, and i £ {!,. . . ,p}. 

Then t{G^) > k(p — i + 1) — r, and this bound is tight. 

Proof. Suppose G is an (n, A;)-generator, and T C [n], \T\ < k{p — i + l) — r. 
Then |y — T| = n — \T\ > kp — r — {k{p — i + 1) — r) = k[i — 1), so in 
a partition into k elements there is a part of size at least i, so T is not 
a transversal of and the proposition is proved. The equality holds for 
g = CONSTR(n,A:). □ 

The extreme case i = p oi Conjecture H] is now easy, and we will need it: 
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Proposition 4 If Q is an {n,k)- generator, then \QP\ > constr^(n, A;) = 
k — r = n — [p— l)k, and if the equality holds Q contains exactly k — r sets 
of size at least p, and they are pairwise disjoint. 

Proof. Apply the preceding proposition to i = p: \QP\ > t{QP) > k — 
r = n — {p — l)k, and if the equality holds throughout, then in particular 
\QP\ = t{QP), that is, all the sets of are pairwise disjoint. □ 

We prove now that Conjecture\^ implies ConjectureUl and Conjecture\^ 
The following lemma deduces the optimality of the construction - that is, 
Conjectured]- by induction on n if and only if there always exists an optimal 
generator containing a vertex of degree at least 2^~^ (which is somewhat 
weaker than Conjecture [3l see Conjecture [5] below.): 

Lemma 1 Let Q he an optimal {n, k) -generator, z G [n], \G{z)\ > , and 
assume constr(n — l,k) = opt(n — l,k). Then: 

\G{z)\ = 2^~^, constr(n, A;) =opt(n, A;). 

Proof. Since G — z generates V{[n] \ {z}), it is an (n — 1, fc)-generator: 

opt(n, k) = \G\ = \Giz)\ + \G-z\> 2f-^ + opt(n - 1, k) 

= 2^""*^ + constr(n — 1, A;) = constr(n. A;) 
so there is equality everywhere. □ 

As a consequence, we see that Conjecture [1] follows recursively for (n. A;) 
if we know Conjecture [3] for all (n'. A;), k < n' < n. 

This recursion raises the question of analysing "the moment when a 
generator deviates from the construction, while n is increased and k is fixed" . 
(We will see that Conjecture [3] is true if n < 3A;). In the construction there 
are vertices z for which CONSTR(n, k) — z is isomorphic to CONSTR(n — 
1, A;). The following theorem shows that |^(^;)| with G — z = CONSTR(n — 
1, A;) has to pay a "high price" for essentially deviating from the construction: 

If is a hypergraph on [n], z S [n] and z ^ U [n], we say that z sees 
U if Giz) nU = V{U). Furthermore it strongly sees U if G{z) D {z]VAV{U). 

Theorem 1 Let G ^ 'P([ra]) he an {n,k)- generator, z G [n], and suppose 

G-z^V{Vi)\J---^V{Vk) 
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for a partition {Vi, . . . , Vk} (V^ 7^ 0, z = 1, . . . , A;) of [n\ \ {z}. Then there 
exists 1 <i < k, let it be i = 1, such that z sees Vi, moreover, if it does not 
strongly see Vi, then \Q{z)\ > 2l^il + m — 1, where m := minj=2,...,fe {Vil- 

Note that since Q — z generates [ra] \ z, in fact the equality holds in the 
condition. Introduce the notation U := {U C Vi,{z} UU ^ G}. Then z 
does not strongly see Vi if and only if 7^ 0; {z} G G implies ^ ^U, and 
therefore U has a nonempty member which is inclusionwise minimal. 

Proof. Suppose for a contradiction that the first part of the theorem is 
false, that is, for alH G {1, . . . , A;}, there exists G V{yi)\{Q{z)r\Vi). Since 
{z} G ^ we have G Q{z) U Vi, so / for all i = 1, . . . , /s. 

Let now Z := {z} U ai U • • • U a^. The set Z is generated by at most k 
members of Q, exactly one of which, - denote it by 51 - contains z. Clearly, 
gr\Vi C aj, and g ai because of the definition of aj (i = 1, . . . , n). So Z\g 
still contains an element from each Vi (i = 1, . . . ,k), and therefore cannot 
be generated by at most k — 1 members o{ Q — z C. V{Vi) U • • • U 7-" (14). 
This contradiction proves the first part of the theorem. That is, we can 
now assume G{z) nVi = V{Vi), define U like before the proof, and note: if 
9 e G{z), gnVi = U eU then g meets [n] \ (Vi U z). 

To prove the stronger inequality of the theorem, let U E U be minimal 

in U\ as noted [/ 7^ 0. Define 

g^u ■■= {9 e G{z) :9nVi = U} = G{zUU)- {Vi \ U), and 
Gcu ■■= {9 e G{z) : gnVi CU} = {G{z) - (Fi \ U)) \ G=u- Clearly, 
G=u n Gcu = 0- Let T := t{G=u /{U U z)), that is, r is the minimum size of 
a set disjoint of C/ U 2; that meets each member of G=u- This minimum is 
finite, since as noted, each member of G=u has an element outside U. Note 
also that \'H\ > t{H) holds whenever the latter is finite. Therefore we can 
suppose T < m without loss of generality, since otherwise \G=u\ >. t > m, 
and 

(ineql) \G{z)\ = \G{z) \ G=u\ + \G=u\ > (2l^^l - 1) + m, 

and nothing else remains to be proved. 

Claim: \Gcu\ > 2l^l + 2"*-^ - 2. 

Since U & U is minimal, z U {V{U) \ U) C. Gcu, so we know already 
2l^l — 1 elements of Gcu- It suffices to show now that Gcu has at least 
2m-T _ I elements that meet [n] \ Vi. 

Let C be a transversal of G=u/{U Uz),\C\=t. Then C CV2U . . .UVk- 
Now the condition of the theorem is satisfied for G — {{Vi \U)L) C), with 
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the same z, and with the partition {U, V2 \ C", ■ ■ ■ , Vfc \ C}: we already know 

?7 / 0, and because of |C| = r < m, \ C / 0, (f = 2, . . . , fc). 

Since U £U and C is a transversal of Q^u / (U \J z), Q{z) — (Vi \ [/) U (7 = 
Qc.u — C. Since z does not see U , by the already proven first assertion of 
our theorem it does see \ C for some i = 2,. . . ,k. Let i = 2: V2 \ C 
has at least m — t elements, and therefore V{V2 \ C) has at least 2"*""^ — 1 
non-empty members. 

Using that z sees Vi, and then applying the Claim and the inequality 

2m-T > ^ _ ^ _|_ ]^ gg^. 

1^(^)1 = \Q{z)\{g=u^gcu)\+\Q=u\+\Qcu\ > 2l^il-2l^l+r+2l^l+2— -2 > 
> 2l^il + T + (m-T + l)-2 = 2l^il + m - 1. 

□ 

The equality case of the bounds is worth analyzing also in hope of gains 
in the estimates: the gains allow to deduce stronger bounds on the degree 
from weaker bound, and therewith the optimality of CONSTR(ra, k) for some 
n and k. In the following analysis and corollary we will suppose Q C 7^([n]) 
is an (n, /c)-generator, z G [n], and Q — z C 'P(Vi)U- • -UViVk) for a partition 
{Vi, . . . , Vk} (14 7^ 0, i = 1, . . . , /c) of [n] \ {z}] we denote n, m the smallest 
and the second smallest size among the sizes {\Vi\ : i = I,... ,k} of the 
partition classes. 

Under the condition of the theorem a first estimate is > 2^, since 

z sees one of the classes. The theorem claims that there is equality in this 
bound if and only if z strongly sees one of the smallest classes. 

It is interesting that the bound jumps from 2l^il to 2l^il + m — 1 if z 
sees Vi but does not strongly see it. What are the conditions of the equality 
then ? 

Proposition 5 Suppose Q C 'P([n]) is an {n,k)- generator, z G [n], and 
g-zC V{Vi)U---LlV{Vk) for a partition {Vi, . . . ,Vk] (Fi / 0, i = 1, . . . , fc) 
of [n] \ {z}. If z sees Vi hut does not strongly see it, that is, U then the 
equality holds in the hound 

(ineq2) > 2l^il + m - 1, 

if and only if there exists 1 < i < k, let it be i = 2 such that IV2I = rn, 
V2 =: {^^1, . . . ,Vm} and choosing the indices appropriately, one of (i)-(iii) is 
true: 
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(i) There exists U ^ Vi such that with Ui := {V{U)\{U}) and U2 = 
{UU{vi} : i = l,...,m},orU2 = {UU{v^} : i = I, . . . ,m-l}U{{vm}}, 

g{z)/z=UiiJU2. 

ill) m = 2,U V{Vi) is arbitrary, g=u := U U {vi} {U E U),and 
G{z)/z = {V{U) \ U) U {g=u :UeU}U M 

(iii) m = l,U is arbitrary, and Q{z)/z = {V{U)\U)'J{g=u ■ U G U}, where 
g=U is the union of U and an arbitrary non-empty set of elements that 
form singleton classes. 

Proof. Suppose the condition of Theorem [1] is satisfied, and (ineq2) is 
satisfied with equality. Then m < t, since m > t would imply that (ineql) 
would also be satisfied with strict inequality, and then so would be the 
identical (ineq2). To have equality in the claim, G{z) cannot contain a set 
that meets a partition-class of size bigger than m different from Vi. 

Consider now U as in the proof, and let U £ U. Let us now exploit the 
equalities in the inequalities of the proof of (ineq2) in the proof of Theorem[T] 
from the end backwards: in order to have equality in (ineq2), we need 
2m-T — ^ _ ^ _^ ^j^^ since m — r > 0, this holds if and only if m — r = 1, 
or m — r = 0. We will have to consider both the case t = m and r = m — 1. 

If m > 2 then |^=(/'| > 1 for all U' £ U, while in (ineql) we used the 
bound of 1 for all but one U £ U. So the strict inequality holds if > 1. 
If |ZY| = 1 the equality can hold, and the two cases corresponding to r = m 
and T = m — 1 are listed in (i) . 

If m = 2 and t = m, then again, |^=(/'| > 1 for all U' £ U, and the 
strict equality can hold only if = 1, included already in the previous 
case. However, if m = 2 and r = m — 1, then |^=r/'| = 1 is possible for all 
U' £ U, and precisely if the unique element of Q=jji is the Q=jji of (ii). So 
all the new cases where equality can occur for m = 2 are listed in (ii). 

If m = 1, then as noticed, all sets in G{z) must be included in the union 
of Vi and the partition classes of size m, that is, must be of the form given 
in (iii). It is easy to check that this is then sufficient: all sets of this form 
are (n, A;)-generators. □ 

We get the following corollary from the theorem and the above analysis 
of the equality. Recall the notations p and m. 
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Corollary 1 G Q ^(M) 'is an {n,k)- generator, z £ [n], andQ—z C 'P(T4)U 
• • • U7^(Vfc) for some partition {Vi, . . . , Vfc} (14 / 0, i = 1, . . . , A;) of [n] \ {z}. 
Then 

(ineqS) \g{z)\ > 2^"^ + m 

unless z strongly sees one of the classes, or one of (i), (ii), (Hi) holds. 

The following lemma states in addition to the optimality of the con- 
struction the unicity of optima - that is, Conjecture [2] - by induction on n 
if and only if every optimal generator contains a vertex of degree at least 
2P~^ (which is still somewhat weaker than Conjecture [3l see Conjecture [6|): 

Lemma 2 Let Q he an optimal {n,k)- generator, z G [n], \Q{z)\ > 2^"^ and 
p > 3; assume that CONSTR(n — l,k) is the unique optimal (n — l,k)- 
generator. Then Q = CONSTR(n, A;). 

Proof. By Lemma[TJ |^(2;)| = 2^^^, and \Q\ = constr(n, k), whence Q — z = 
constr(n, k) — 2^~^ = constr(n — 1, fe), and then by the condition, Q — z = 
CONSTR(n-l,A;). 

SoG - z = {P{Vi) U • • • U 7^(Vfc))\{0}, where {Vi,..., Vk} is a partition 
of [n] into parts of size p{n,k) and p{n,k) — 1. By Theorem [1] one can 
choose Vi so that either G{z)/z = V{Vi), or \G{z)\ > 2l^il + m - 1 with 
m = minj=2,...,fc \Vi\ = p{n, k) -1>2. 

In the first case, by optimality, Vi is a class of size p{n, /c) — 1 so that 
G = CONSTR(n, k) follows. If indirectly, the second case holds, then 

2P-1 = \g{z)\ > 2P-^ + m - 1 > 2P'^ + 1, 

and this contradiction finishes the proof. □ 

Modified as follows. Conjecture [3] becomes equivalent to Conjectured] by 
Lemma [H 

Conjecture 5 For all n,k € ]N there exists an optimal {n,k)- generator Q 
and z G [n] such that: 

(2) \g{z)\ > 2P("''=)-i. 

Modified as follows. Conjecture [3] becomes equivalent to Conjecture [2] by 
Lemma O 
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Conjecture 6 For all n,k £ IN, for every optimal {n,k)- generator Q there 
exists z G [n] such that (2) holds. 

the following implication between the conjectures : 
Conjecture [2] Conjecture [H 
Conjecture [T] 
- Conjecture [51 
' Conjecture O 

Let us also state the conjecture asserting that the disjointness require- 
ment does not change the optimum value. 

Conjecture 7 For all n,k £ IN: op{n,k) = opt(n, /c). 

So far all the simple Propositions, Lemmas and Conjectures hold without 
change if disjointness is not required and op is written instead of opt. This 
is not true though for Theorem [1] and its corollaries, including Lemma [2] 
and Proposition \5\ the reason being that we used in an essential way that 
at most one of the k disjoint sets contains a given z £ [n]. 

4 Case p < 3 

Recall the notation p = p{n, k) = \^~\ and n = pk — r with < r < A;. In 
this section we prove all the conjectures for p <2>. This is done in Theorem[2] 
for p <2, and in Theorem U] for p = 3. (In the Appendix we add to this the 
two first cases with p = A: (n, k) = (7, 2) and (n, k) = (8, 2).) 

Theorem 2 If p < 2, that is 1 < n < 2k, then op(n, k) = opt(n, k) = 
constr(n. A;), furthermore, for any (not necessarily optimal) (n,k)- generator 
Q, (1) holds, and there exists z £ [n] such that (2) holds. A generator Q 
is optimal if and only if it consists of all the singletons in [n] and n — k 
pairwise disjoint sets of size at least 2. 

In particular, the construction is the unique optimal generator if n < /c 
or n = 2k, but it is not unique if A; < n < 2k. However, if A; < n < 2A:, 
Conjecture m follows still easily, and it is also not an exception of Theorem [1] 
or the reformulation of its essential part in Lemma [21 useful for proving 
unicity; this case is an exception to unicity only because for m = 1 - and 
only in this case - Theorem [1] does not exclude other optimal solutions of 
the same size, and they indeed, exist, and are already mentioned in the (iii) 



We have thus 
Conjecture [3] =^ 
Conjecture [D =^ 
Conjecture [1] <^=^ 
Conjecture [21 <^=^ 
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case of equality. The reason for this is nothing more than the vahdity of 
2P~^ + m — 1 = 2P~^ in this case. 

This is also the only case when "Turan's bound" T{n, A; + 1, 2) is exact. 

Proof. Let G be an arbitrary (n, /c) -generator. It contains all the singletons, 
and if p = 1, that is, n < A; there is no need of more members. 

If p = 2, that is. A; + 1 < n < 2fe, then by Proposition HI l^^l > 
constr^(n, /c) = k — r = n — k, and the equality holds if and only if the 
sets of size at least 2 are disjoint. 

Conversely, suppose the hypergraph Q has n — k disjoint members of size 
at least 2 (/c + 1 < n < 2A;), and let us check that it is an (n, A;)-generator. 
Let S C [n],s := |5| > k. Then S misses at most n — s < k members 
of Q^, so it contains at least n — k — (n — s) = s — k members of G'^, all 
pairwise disjoint. So S can be generated by s — A; members of Q'^ plus at 
most s — 2(s — A;) = 2A; — s singletons. 

□ 



Theorem 3 If p = 3, that is 2k < n < 3k, then for any (not necessarily 
optimal) [n, k)- generator Q , (1) holds. 

Proof. We have (1) for i = 3 by Proposition U) \Q'^\ > constr^(n, k) = n—2k. 

Now we prove (1) for i = 2, by induction on n — 2A;. By Theorem [2] it is 
true for n = 2k. For the sake of easier understanding, we first do the proof 
separately for n = 2k + 1, using it for n = 2k: For all z G [2k + 1] we have 
\Q'^ — z\ > constr^(2fe. A:) + 1 = A: + 1, otherwise we are done by Lemma [21 
Now 

> {2k + l){k + l), 

and in this sum every member of is counted at most 2A; — 1 times, so 
l^^l ^ + 1) = iiT^if« + 1) > k + 2. (For an easier look at it we 

used here that multiplying a number x by ^j*!^^^ it increases by more than 
1 if and only if x > A; — 1/2.) Since constr'^(2A; + 1, A;) = constr^(2A:, A;) + 3, 
l^^l > A; + 3 = constr2(2A;, A;) + 3 = constr2(2A: + 1, A:), as claimed. 

Similarly, for an arbitrary (n, /c)-generator, 2A; + 1 < n < 3A;, we have 

\g^\ > —^(opt(n - 1, A;) - (n - 1) + 1) > opt(n - 1, A:) - (n - 1) + 2, 
n — 2 

since opt(n — 1,A;) > and the statement follows then using 

constr^(n. A:) = constr^(n — 1, A;) + 3. □ 
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We do not see how to deduce Conjecture [3] from the above theorem. 
On the other hand, we can prove this conjecture separately (for p = 3), 
implying the previous theorem as well, in a simpler way, and without using 
any of the previous results or the disjointness of generators. (For i = 3 
(2) is easy, and the following theorem implies it for i = 2 and i = 1. For 
2k < n < 3k we will thus have two proofs of the optimality. (We still 
included the previous theorem because it forecasts our future difficulties: 
whenever the average degree of CONSTR(n, k) is much smaller than the 
maximum degree, "averaging arguments" do not easily work.) 

Theorem 4 If p = 3, that is 2k < n < 3k, then op(n, k) = opt(n, k) = 
constr(n. A;), furthermore, for any (not necessarily optimal) {n,k)- generator 
Q, (1) holds, and there exists z E [n] such that (2) holds. The construction 
is the unique {n,k)- generator. 

Proof. We prove, without requiring disjointness, that for any (n, k)- 
generator Q, there exists z G [n] such that (2) holds. 

We can suppose without loss of generality n = 2k + 1. Indeed, if n > 
2k + 1, then we can apply the proven assertion to the {2k + 1, A;)-generator 
g{U), where U C [n], \U\ =2k + l. 

Let Q be an (n, A;)-generator, and suppose for a contradiction |t/^(-z)| < 2 
for all z G [n]. 

We define an undirected graph G = {V, E) on V := [n] = [2A; + 1], in the 
following way: for each g £ G, \g\ > 2, we choose two vertices u,v £ g, let 
e = uv G E, and use the notation g^ for g. For (71 7^ 52 G ^ we can take the 
same u,v (if u,v £ gi Ci 52)) but then we take two parallel uv edges ei and 
62- We will say that the edge e = uv represents ge £ Q- We thus suppose 
that different sets in Q are represented by different edges. Furthermore, we 
suppose that we make the possible choices of u and v so as to minimize the 
number of components of G. 

Now it follows from the indirect assumption that all the degrees of the 
graph G are at most 2, so it is a disjoint union of cycles, paths and isolated 
vertices. The following Claim is the key of the proof: 

Claim: Let C be a cycle of G, and e an edge of C. Then e £ Q, and is not 
contained in any bigger set of Q. 

Indeed, by the definition of G, e is contained in a set of Q, so it is 
sufficient to prove that no set in Q can properly contain e. 

- If an extra element z of ge (different from the endpoints of e) of such 
a set were in C, then z would be contained in three different sets of 



15 



Q: ga and g^, where a,h are the two edges incident to z in C, and 
5e 5 e U {z}. Clearly, e, a, b are different, and therefore ge, ga-, % as 
well, contradicting the indirect assumption. 

- If an extra element z of ge (different from the endpoints of e) of such 
a set were in another component K of G, then replacing one of the 
endpoints of e by a point in g^, n K, we get another representation of Q 
with one less component (all vertices of C and K are now in the same 
component), contradicting the definition of G. 

The claim is proved. 

Let U be the set of vertices of G that are in a cycle. The subgraph 
Giy \ U) contains only paths and isolated vertices, so we can find a stable 
set (not containing both endpoints of an edge) S of G{V \ U) such that 
|5'| > \ U\/2. (We take a (the) bigger stable set in each component.) 

We show now that SUU cannot be /c-generated, contradicting the choice 
of Q. Recall that any g ^ g ^ S has also an edge in G. But the only 
edges in 5 U [/ are in the cycles, and for these the claim holds. Therefore 
what we have to show is exactly that U C/ is not the union of at most k 
edges of G or singletons. 

Indeed, denote ^{X) the minimum number of edges and singletons nec- 
essary for generating a set X C n. Let the components of G be Ci, . . . , Ct 
{t G IN). Note that for alH = 1, . . . , t : 7(C/n Q) > \Gi\/2. Then 

7(f/) = ^7(f/ n G.) > ^ IG.I/2 = > k. 

i=l i=l 

So U cannot be /c-generated, a contradiction. 

By lemma [U (that does not require disjointness), op(n,/c) = opt(n,A;) = 
constr(n, k) follows. 

When disjointness is required, by lemma [21 the contruction is the unique 
optimal (n, fc)-generator. □ 

5 Optimization and approximation 

The general problem this work is concerned with is natural to be asked in 
terms of combinatorial optimization, including also computational complex- 
ity and approximation ratios. In this section we would like to present our 
related observations: some negative results concerning the computational 
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complexity, and simple but surprisingly good estimates for the quantity 
opt(n, k). 

Two natural optimization problems arise: 

- We do not want to generate all cars, that is, all subsets of options, just 
a pre-given family. 

- The generator is restricted to choose elements from a given hyper- 
graph. 

More precisely: 

PROBLEM: CHOOSY CUSTOMER'S DIVERSITY 
Input: C C 'P([n]), numbers k,s. 

Question: Does there exist G Q V{[n\) that /c-generates all sets in C, and 
\G\ < s. 

PROBLEM: CONSTRAINED PRODUCER'S DIVERSITY 
Input: 7i C 'P([n]), number k and a target-set T C [n]. 
Question: Does there exist Q (ITC that A;-generates T ? 

Note that in this second problem we only speak about the existence of 
a generator. These are just two simple and natural variants that we choose 
for the sake of examples. The reader may enjoy stating his favorite variants 
and checking NP-completeness for them. 

Theorem 5 Both CHOOSY CUSTOMER'S and CONSTRAINED PRO- 
DUCER'S DIVERSITY problems are NP- complete. 

Proof. We first reduce VERTEX COVER to CHOOSY CUSTOMER'S 
DIVERSITY, and even to instances where k = 2. (VERTEX COVER and 
3DM below are proved to be NP-complete in Carey and Johnson's seminal 
book 

Let G = (y, E) be a graph, and consider the problem with input = 
V U {n}, where u is an extra vertex not in V , and C := {{f} : u E 0} U 
{{o, 6, u} : o, 6 G V, ah G E}. 

Clearly if T is a vertex cover, that is T n e 7^ for all e £ E, then 
G := {{v} : G 17} U {{t,u} : t G T} does 2-generate all C G C. Conversely, 
{{v} : V £ 0,} must be contained in all generators, and all the other sets can 
be supposed to contain u and to be of size 2. (Otherwise we can add u and 
keep only one of the elements different from u.) Let T := {v £ V : {v, u) G 
Q}. Then T is a vertex cover, finishing the proof of the first assertion. 
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Let us now reduce 3DM to CONSTRAINED PRODUCER'S DIVER- 
SITY. Let {U, V, W, E) be an instance of 3DM, that \s, E U V W 
(the Cartesian product of U,V,W), where \U\ = \V\ = \W\ = ?>k. Define 
T := U U V U W. Now clearly, Q C. E fc-generates T if and only if it is a 
3-dimensional matching (that is, if and only if it partitions T). □ 

In both proofs it is irrelevant whether we ask disjointness or not from the 
generators. (In these cases there exists always a disjoint optimal solution.) 

We now show that the construction provides a quite good approximation 
of the optimum. Enumeration provides the bound constr(n, k) < opt(n + 
2k, k). Let us sketch a proof of this. Given an {n,k) generator Q, all the 
2" — 1 nonempty subsets of [n] can be encoded by an at most k element 
subset of Q: 



> 2*^ 

=1 

It follows that k\g\^/kl > 2", that is, \g\'' > {k - 1)!2", and apply- 
ing Stirling's formula and taking the k-th. root: \g\ > t^'^ri/k^ 
opt(n, /c) > .^2"/''', while constr(n. A:) < fc2"/'^+const, which shows that 
constr(n, /c)/ opt(n. A:) does not exceed e{n,k)e where limn^k^oo^in, k) = 1. 
The exact treshold valid for all n and k is certainly smaller than 4: 
constr(n,A;) < 4opt(n, /c). Since constr(n -|- 2/c, A;) > 4constr(n, fc), we got 
that constr(n, k) < opt(n + 2k, k). 

For small A; we do not have to apply Stirling formula and we get essen- 
tially better bounds: for A; = 2, we get \Q\ + ('2') > 2" — 1 and we get the 
same bounds as in the theorems below. Still with the same method, for 
A; = 3 we get that the construction is at most --4= = 1,747- • • times the 
optimum. Let us deduce the results for A; = 2 with another method as well, 
which will also lead to a simple general proposition for arbitrary k: 

Theorem 6 For all n G M: 

opt(n,2) < constr(n,2) < 3/2opt(n,2), 

and the constant 3/2 can actually he improved to \f2 if n is even. 

Expressing opt(n, 2): C constr(n, 2) < opt(ri, 2) < constr(n, 2), with C = 
2/3 if n is odd, and C = V2/2 if n is even. 

Proof. Let Q be an (n, 2)-generator. Since every subset of [n] containing z 
is the union of a set in Giz) and a set in {Q — z) U {0}, we have : 
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The minimum of x + (x, y G R) under the condition xy = 2^ ^ is x = 

n—1 

y = 2~2-. Therefore, if in addition Q is an optimal (n, A;)-generator, then 

opt(n,2) = \g\ = \g{z)\+\g-z\ > mm{x+y-l : xy = 2'*-^} = 2'^+2^-l. 

n—1 n+l 

On the other hand, constr(n, 2) = 2^~ — 1 + 2^~ — 1 if n is odd, and 
constr(n, 2) = 22 — 1 + 22 — litnis even. □ 

If we compare constr(n, 2) with the same estimates applied to opt(n + 
1, A;) or opt(n + 2, A;), we get the following: 

Theorem 7 For all n G IN: opt(n, 2) < constr(n, 2) < opt(n + 1, 2), if n is 
even, and opt(n, 2) < constr(n, 2) < 3/4opt(n + 2, 2), if n is odd. 

Finally, we prove now with the same method a general statement which 
has only self-interest so far: for a hypergraph TL let 

a{H) := maxIS" C [n] : H D S is a singleton for all H G H}. 

Note that an (n, fc)-generator Q always satisfies a{Q) < k. On the other 
hand, for all n,k, a(CONSTR(n, fc)) = k. Conversely, a generator Q with 
<y{g) = k looks close to the optimum, and we can easily prove that it is 
optimal, if n = pk: 

Proposition 6 Suppose n = pk, opt(n — k,k) = constr{n — k, k), and that 
there exists an optimal {n,k)- generator Q, a{Q) = k. Then opt{n,k) = 
constr(n, /c) and CONSTR(n, /c) is the unique optimal {n, k)- generator, pro- 
vided the same holds for {n — k,k). 

If k = 2, the condition is a{g) = 2 and this means x,y E [n] such that 
G{x) and G{y) have no common elements. The proposition confirms all the 
conjectures under this condition (which is true for CONSTR(n, 2)). 

Let S" be a set that meets all members of H only in one element, IS"! = k. 
We will actually show that at least constr(n, k) sets are needed only to 
generate all sets in S U P{[n] \ S) and in P([n] \ S) ! 

Proof. Clearly, any set containing S is generated by exactly k sets, exactly 
one from each Q{s) {s e S). Thus 

l[\g{s)\>2-K 

seS 
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By the inequality between the geometric and arithmetic means, we have 
under this condition 

(ineqA) ^ \g{s)\ > kl"^ = k2P-\ 

The equahty holds in (ineq4) if and only if |^(s)| = 2~^, and the mem- 
bers of Us£sG{s) generate Hses 1^(^)1 sets; the latter condition holds if and 
only if any pair of sets from different G^s) are disjoint. 

Define for all s £ S, Ps := UG{s). Because of = 2~k~ we have 

\Ps\{s}\ > that is, 

E, „ , , ,n — k 
|P.| >fc(— ^ + 1) >n, 

and if there is equality in (ineq4) and therefore the sets Pg are pairwise 
disjoint, then there is equality everywhere, that is, \Ps\ = ^^^^ + 1 = n/k = p 
for all s G S. 

We have arrived now to our final estimation one ingredient of which is 
(ineq4), and the other is the obvious inequality |^ — /Sj > opt(n — A:, k). Then 

opt(n, k) = \g\ = \g - s\ + Yl 1^(^)1 ^ °Pt(" -k,k) + k2P-^ = 

= constr(n — k,k) + A;2^~^ = constr(n, k). 

So opt(n, /c) = constr(n. A;), and there is equality everywhere, so ^ — S" is 
optimal. If CONSTR(n — k, k) is the unique optimal (n — k, A;)-generator 
then Q — S \s isomorphic to CONSTR(n — k,k). Finally, applying Lemma [2] 
k times one by one to the elements of s in the role of z, we see that Q = 
CONSTR(n,A;). □ 

Conclusion: We proved that the most natural construction for an (n, k)- 
generator is optimal if n < 3k, and for some other individual pairs {n,k), 
regardless whether the disjointness of the sets is required, moreover, it is 
always a constant time approximation with a small constant. The natural 
formulations as an optimization problem are NP-hard. 
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APPENDIX: k = 2 and can we go further ? 



We deduce the conjecture for two more cases, also in order to provide another 
example of applying the arguments and assertions of the paper, and to realize 
the limits of some arguments. 

The following lemma extends the validity of Theorem [T] to the case when 
G — z can contain one more set besides subsets of the partition-classes. We 
restrict ourselves to the case k = 2 (the statement and its use seem to be 
considerably more complicated (even if not hopeless) for k > 2) : 

Lemma 3 Suppose Q is an (n, 2) -generator, {G — z) CI T'{Vi)[JV{V2)U {h} , 
where {{z},Vi,V2} {z G [n]) is a partition of [n], 2 < fj, := \Vi\ < IV2I 
(i = 1, 2), /i C y. Then \G{z)\ > 2^, in particular, Q is not optimal. 

Of course, we can suppose without loss of generality /i n / (i = 
1,2), otherwise h can be omitted from Q, and the assertion follows from 
Theorem [TJ 

Proof. If z sees Vi or V2 we are done, so we suppose it does not. 

Claim: For both i = 1 and i = 2, there is at most one subset of Vi that is 
not in Q{z) □ Vi. 

Suppose for a contradiction that the statement does not hold say for 
i = 2: let B ^ C O V2, B,C ^ g{z) U V2. Since z does not see Vi, there 
exists A(^Vi, g{z) U Vi. We show then \G{z)\ > 2l^il. 

The sets {z} D Au B, {z} D AuC must contain h that must participate 
in 2-generating these sets, whence 

{z} u{AuB)\ {h}, {z} u (A u c) \ {h} G g. 

We show now that |^(z)| > 2^^ , by labelling each subset of Vi with a different 
set in g{z). 

If U C Vi, U G g{z) nVi, we label U with an arbitrary g £ G{z), 
gOVi = U. For instance we label with {z}. If A ^ Giz) ^Vi, we saw that 
there exist two sets, {z} U{AuB)\ {h}, {z} U{AuC)\ {h} G Q. At most 
one of them is the label of A\{h}, the other, say {A U C) \ {h} is a priori 
not a label, since it meets Vi also in A\{h}, but it is not the label of this 
set. Let the label of A he {AU C)\ {h}. Clearly, the label of a different set 
A' C Vi, A' ^ Q{z) n Vi is different, since it is ^' U C \ {/i}, different from 
A\JC \ {h}. (Both A\JC and A' \J C contain h.) The claim is proved. 

The claim implies that |^(z)| > 2l^il — 1, but we are still fighting for the 
strict inequality here. Let 1 G /iPlVi, 2 G /iHl^. By Theorem [H z sees 
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Fi \ {1} and V2 \ {2} (since it does not see V2 and Vi). If it strongly sees 
both of them, then z U (Vi \ {!}), z U (V2 \ {2}) ^ G, and the only common 
element of these two is z, so the bound is largely satisfied. If not, then in 
Theorem [1] the equality is not satisfied, so there exists A CVi and f,g £ G 
such that A = fnVi = gnVi for f g e G, so the equality \G{z)\ = 2l^il - 1 
does not hold. □ 



Theorem 8 For any (not necessarily optimal) {7,2)-generator, (1) holds, 
and C0NSTR(7, 2) is the unique optimal generator. 

Proof. We first prove the second assertion. Let G be an optimal (7, 2)- 
generator. Then \G\ < constr(7, 2). Add to G some new sets to get a 
hypergraph G with \G\ = constr(7, 2) = 22. Obviously G is still a gen- 
erator. It suffices to prove now that G = C0NSTR(7, 2). Indeed, then 
C0NSTR(7, 2) = G = G follows since G does not contain any other genera- 
tor properly. 

Let d := l/'raX]a;e[n] ^(^) ^e the average degree of G- Clearly (as before, 
see Proposition [1]): 

n 

{ineqb) dn = |^(2;)| = \g\ = G^- 

x&[n] g^g i=l 

Claim 1: d>6 

We already know \G^\ > 22 and therefore |^^| > 15 as well. At the other 
extreme |^^| > 1 is obvious, let A £ G^- We show \G^\ > 5. 

- If there exists z ^ A, \G'^{z)\ > 2, then apply Proposition [3] after 
deleting z: \G^ ~ -^1 ^ t{G'^ — z) > 2. But this bound is self-improving: 
1^1 > 4, so ^ is not disjoint of the other set in G^ — z, and therefore 
1^^ -z\> t{G^ - z) -M > 3. But then \G'^{z)\ + \G^ - z\ >2 + = 

- If there exists z G A, \G'^{z)\ > 3, then similarly, apply simply \G^—z\ > 
t{G^) - 2 > 2 to get 1^3(^)1 + 1^3 _ ^1 > 3 + 2 = 5. 

- One of the preceding cases holds, because otherwise every z G [n] is 
covered by at most one member of G^ \ {A}, although there are at 
least 3 sets of size at least 3 in this hypergraph on 7 elements. 

We conclude now the proof of the claim by (ineq5): 
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^ ^ 22 + 15 + 5 + 1 43 ^ 
a > = — > 6. 



According to the claim there exists x E [n], |^(x)| > 7, that is, |^ — x| < 
22 — 7 = 15 = constr(6, 2) + 1. If the strict inequality holds, we are done by 
Lemma [21 so we can suppose |^(x)| = 7. 

Now Proposition [1] can be applied for n = 6,k = 2,p = 3: there exists 
z G [n], g{z) - X > 2P-1 + 1. So \g - {x,z}\ < constr(5,2), and the 
equality holds here by Theorem [2j Now Theorem [1] can be applied to deduce 
that z strongly sees the class of size 2 of ^ — {x,z}, since m = 3. So 
g — X contains a hypergraph isomorphic to C0NSTR(6, 2), meaning that it 
is exactly C0NSTR(6, 2) and one more element h. We conclude now the 
second part of the theorem with Lemma [3] substituting z for x. 

Let now Q be an arbitrary (7, 2)-generator. By the already proven part 
we have (1) for i = 1 and i = 2. It is also obvious for z = 4; as above, 
denote A £ G'^. In exactly the same way as we proceeded above, we can get 
l^^l > 5, after which it is still possible to do one more self-improving step, 
to prove l^^l > 6 = constr^(7, 3), as claimed: 

Suppose for a contradiction |^'^| < 5. A set T e G, \T\ = 3 will be called 
a triangle. 

Claim 2: If > 3 then G"^ — z has exactly two disjoint triangles, and 

these partition [n] \ z. 

Indeed, \g^ - z\ > T{g^ - z)>2, and if one of these two inequalities is 
strict, then we arrive at the contradiction 5 > |^'^| = + — z| > 

3 + 3 = 6. 

The average degree of is at least 4+3+3+3+3 ^ ^Q/^ > 2. So there 
exists z £ Q, \Q'^{z)\ > 3, and Claim 2 can be applied. Let Ti and T2 be the 
two triangles of — z provided by Claim 2. Since A is not a triangle, it 
does not coincide with any of these, so z £ A. Let T3 / T4 G Q{z) \ {A}. 

Claim 3: T3 n r4 = {z}. 

Indeed, another common element of T3 and T4, denote it by x, would 
also be contained in Ti or T2, say Ti. Then x G Ti n T3 n T4, and also x £ A, 
since if not, A with x ^ A £ \s not a triangle, contradicting Claim 2. But 
then > 4, — x| > t{Q^ — x) > 2, contradicting the assumption 

5 > |a^|(> 4 + 2 = 6). 
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It follows that A meets one of T3 and T4 and not only in z. Indeed, 

\A \ {z}\ + \T3 \ {z}\ + |r4 \ {z}\ > 3 + 2 + 2 = 7 > 6 = IN \ {z}\. 

Let this element be x S ^ n T3 \ {z}; since x G Ti U we can assume for 
instance x G Ti. 

Now again, Claim 2 can be applied to x, and since \ Q^\ < 5, both triangles 
of ^ — X are already among the listed sets. These can be only T2 and T4, in 
particular T4 is also a triangle. 

So T4 = (Ti \{x})U{2:}. Because of Claim 3, T3 contains, besides x G Ti 
also an element of T2. Finally, A = {x, z} U {T2 \ T^), since any other element 
in A would again contradict Claim 2. It follows that G'^ = {A}. In order to 
2-generate {1,2,3,4,5,6,7} itself, we need a set in Q'^ and its complement. 
But the complement of A is different of all of Ti, T3, r4, so has a sixth 
element, and this final contradiction finishes the proof of the theorem. □ 

Corollary 2 For any (not necessarily optimal) (8, 2) -generator, then (1) 
holds, there exists z G [n] such that (2) holds, and CONSTR(8,2) is the 
unique optimal [8, 2) -generator. 

Proof. \Q'^\ > 2 is obvious as usually (since each 7 element set still contains 

5 e a, \g\ > 4). 

- z\> 8constr3(7,2) = 48, 

ze[n] 

and every set of has been counted at most 5 times in this sum, so > 
[48/5] = 10 = constr3(8,2). 

It is now easy to prove \Q'^\ > constr^(8, 2) (and the same l^"*^! > 
constr^(8, 2)), with the same argument as in the proof of the previous the- 
orem: it suffices to proof Q with \Q\ = constr(8,2) = 30 is nothing else but 
CONSTR(8,2), and for this it suffices to prove that Q has an element of 
degree 2^ = 8. So the only remaining assertion to prove is that for any 
(8, 2)-generator with \Q\ = constr(8, 2) = 30 there exists z G [n] such that 
(2) holds. Then the last assertion also follows by Lemma EJ Let Q be such 
an (n, /c)-generator. 

Let d := ^/nJ2xe[n] ^(^) be the average degree of Q. Clearly (as before, 
see Proposition [1]): 

n 

d = l/nY^ \g{x)\ = l/nJ2\9\ = 1/^X^1^*1 = 1/8(30 + 22 + 10 + 1) > 7, 

xe[n] g^g 1=1 
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finishing the proof of the corohary. 



□ 



Note that for this last statement a much weaker bound is sufficient, 
namely the first easy estimate of without the worksome one. 

In C0NSTR(8, 2) the average degree is equal to the maximum degree and 
the same could be proved for the optimum generator, that is why Corollary [2] 
includes Conjecture [3l The same can be proved for arbitrary even n and 
k = 2, but the odd n case with "small" average degree remains open. 
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